A Decision Tree Method for Finding and Classifying Names in Japanese Texts

نویسندگان

  • Satoshi Sekine
  • Ralph Grishman
  • Hiroyuki Shinnou
چکیده

This paper describes a system which uses a deci sion tree to nd and classify names in Japanese texts The decision tree uses part of speech character type and special dictionary informa tion to determine the probability that a particu lar type of name opens or closes at a given po sition in the text The output is generated from the consistent sequence of name opens and name closes with the highest probability This system does not require any human adjustment Ex periments indicate good accuracy with a small amount of training data and demonstrate the system s portability The issues of training data size and domain dependency are discussed

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ارزیابی متغیرهای پیش‌آگهی در رده‌بندی نرخ بقای بیماران مبتلا به سرطان کولورکتال با استفاده از درخت تصمیم

Background ; Objectives: Identifying the important influential factors is a great challenge in oncology studies. Decision tree is one of methods that could be used to evaluate the prognostic factors and classifying the patients' homogeneously. This method identifies the main prognostic factors and then determines the subgroups of patients based on those prognostic factors. The aim of this...

متن کامل

سیستم شناسایی و طبقه بندی اسامی در متون فارسی

Name entity recognition (NER) is a system that can identify one or more kinds of names in a text and classify them into specified categories. These categories can be name of people, organizations, companies, places (country, city, street, etc.), time related to names (date and time), financial values, percentages, etc. Although during the past decade a lot of researches has been done on NER in ...

متن کامل

Comparing different stopping criteria for fuzzy decision tree induction through IDFID3

Fuzzy Decision Tree (FDT) classifiers combine decision trees with approximate reasoning offered by fuzzy representation to deal with language and measurement uncertainties. When a FDT induction algorithm utilizes stopping criteria for early stopping of the tree's growth, threshold values of stopping criteria will control the number of nodes. Finding a proper threshold value for a stopping crite...

متن کامل

Classifying the Customers of Telecommunication Company in order to Identify Profitable Customers Based on Their First Transaction, Using Decision Tree: A Case Study of System 780

Effective knowledge and awareness of customers require the market segmentation, through which the customers who have the same needs and purchasing patterns as well as the same response to marketing plans are identified. The selection of a proper variable is a requirement, among other, for a successful market segmentation. In today' world, on one hand, the consumers are bombarded with new goods ...

متن کامل

A New Statistical Approach for Recognizing and Classifying Patterns of Control Charts (RESEARCH NOTE)

Control chart pattern (CCP) recognition techniques are widely used to identify the potential process problems in modern industries. Recently, artificial neural network (ANN) –based techniques are very popular to recognize CCPs. However, finding the suitable architecture of an ANN-based CCP recognizer and its training process are time consuming and tedious. In addition, because of the black box ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998